Goto

Collaborating Authors

 Tyrrhenian Sea


A PDE-Informed Latent Diffusion Model for 2-m Temperature Downscaling

Rosu, Paul, Bahng, Muchang, Jiang, Erick, Zhu, Rico, Tarokh, Vahid

arXiv.org Artificial Intelligence

Earth system models (ESMs) are critical for weather forecasting, tracking weather extremes, and supporting impact studies. In particular, numerical weather prediction (NWP) methods track surface and atmospheric data by dissecting the Earth's surface into grids, tracking variables of interest (e.g., temperature, wind speed, direction) as scalar/vector fields, and numerically solving partial differential equations (PDEs) to either physically interpolate into unknown regions or temporally evolve the model--a process known as reanalysis [20, 15]. Historical reanalysis datasets such as ERA5, MERRA-2, and NCEP primarily consist of coarse-scale grid resolutions of 31 31 km to 500 500 km collected by weather stations, aircrafts, and meterological satellites [5, 6, 10]. However, climate simulations at finer resolutions down to 2 2 km are critical for understanding short-term forecasting (nowcasting and medium-range forecasting) and predicting localized weather extremes described by highly resolved fields. As manual collection of such high-resolution data on a global scale is too resource-intensive, global climate models (GCMs) perform downscaling to increase the resolution of surface data by employing two general types of techniques: dynamical and statistical downscaling [3, 11].


Multi-Modal Drift Forecasting of Leeway Objects via Navier-Stokes-Guided CNN and Sequence-to-Sequence Attention-Based Models

Adesunkanmi, Rahmat K., Brandt, Alexander W., Deylami, Masoud, Echeverri, Gustavo A. Giraldo, Karbasian, Hamidreza, Alaeddini, Adel

arXiv.org Artificial Intelligence

Accurately predicting the drift (displacement) of leeway objects in maritime environments remains a critical challenge, particularly in time-sensitive scenarios such as search and rescue operations. In this study, we propose a multi-modal machine learning framework that integrates Sentence Transformer embeddings with attention-based sequence-to-sequence architectures to predict the drift of leeway objects in water. We begin by experimentally collecting environmental and physical data, including water current and wind velocities, object mass, and surface area, for five distinct leeway objects. Using simulated data from a Navier-Stokes-based model to train a convolutional neural network on geometrical image representations, we estimate drag and lift coefficients of the leeway objects. These coefficients are then used to derive the net forces responsible for driving the objects' motion. The resulting time series, comprising physical forces, environmental velocities, and object-specific features, combined with textual descriptions encoded via a language model, are inputs to attention-based sequence-to-sequence long-short-term memory and Transformer models, to predict future drift trajectories. We evaluate the framework across multiple time horizons ($1$, $3$, $5$, and $10$ seconds) and assess its generalization across different objects. We compare our approach against a fitted physics-based model and traditional machine learning methods, including recurrent neural networks and temporal convolutional neural networks. Our results show that these multi-modal models perform comparably to traditional models while also enabling longer-term forecasting in place of single-step prediction. Overall, our findings demonstrate the ability of a multi-modal modeling strategy to provide accurate and adaptable predictions of leeway object drift in dynamic maritime conditions.


DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy

Xu, Kaixuan, Chai, Jiajun, Li, Sicheng, Fu, Yuqian, Zhu, Yuanheng, Zhao, Dongbin

arXiv.org Artificial Intelligence

Diplomacy is a complex multiplayer game that requires both cooperation and competition, posing significant challenges for AI systems. Traditional methods rely on equilibrium search to generate extensive game data for training, which demands substantial computational resources. Large Language Models (LLMs) offer a promising alternative, leveraging pre-trained knowledge to achieve strong performance with relatively small-scale fine-tuning. However, applying LLMs to Diplomacy remains challenging due to the exponential growth of possible action combinations and the intricate strategic interactions among players. To address this challenge, we propose DipLLM, a fine-tuned LLM-based agent that learns equilibrium policies for Diplomacy. DipLLM employs an autoregressive factorization framework to simplify the complex task of multi-unit action assignment into a sequence of unit-level decisions. By defining an equilibrium policy within this framework as the learning objective, we fine-tune the model using only 1.5% of the data required by the state-of-the-art Cicero model, surpassing its performance. Our results demonstrate the potential of fine-tuned LLMs for tackling complex strategic decision-making in multiplayer games.


BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models

Wiland, Jacek, Ploner, Max, Akbik, Alan

arXiv.org Artificial Intelligence

Knowledge probing assesses to which degree a language model (LM) has successfully learned relational knowledge during pre-training. Probing is an inexpensive way to compare LMs of different sizes and training configurations. However, previous approaches rely on the objective function used in pre-training LMs and are thus applicable only to masked or causal LMs. As a result, comparing different types of LMs becomes impossible. To address this, we propose an approach that uses an LM's inherent ability to estimate the log-likelihood of any given textual statement. We carefully design an evaluation dataset of 7,731 instances (40,916 in a larger variant) from which we produce alternative statements for each relational fact, one of which is correct. We then evaluate whether an LM correctly assigns the highest log-likelihood to the correct statement. Our experimental evaluation of 22 common LMs shows that our proposed framework, BEAR, can effectively probe for knowledge across different LM types. We release the BEAR datasets and an open-source framework that implements the probing approach to the research community to facilitate the evaluation and development of LMs.


Sea wave data reconstruction using micro-seismic measurements and machine learning methods

Iafolla, Lorenzo, Fiorenza, Emiliano, Chiappini, Massimo, Carmisciano, Cosmo, Iafolla, Valerio Antonio

arXiv.org Artificial Intelligence

Sea wave monitoring is key in many applications in oceanography such as the validation of weather and wave models. Conventional in situ solutions are based on moored buoys whose measurements are often recognized as a standard. However, being exposed to a harsh environment, they are not reliable, need frequent maintenance, and the datasets feature many gaps. To overcome the previous limitations, we propose a system including a buoy, a micro-seismic measuring station, and a machine learning algorithm. The working principle is based on measuring the micro-seismic signals generated by the sea waves. Thus, the machine learning algorithm will be trained to reconstruct the missing buoy data from the micro-seismic data. As the micro-seismic station can be installed indoor, it assures high reliability while the machine learning algorithm provides accurate reconstruction of the missing buoy data. In this work, we present the methods to process the data, develop and train the machine learning algorithm, and assess the reconstruction accuracy. As a case of study, we used experimental data collected in 2014 from the Northern Tyrrhenian Sea demonstrating that the data reconstruction can be done both for significant wave height and wave period. The proposed approach was inspired from Data Science, whose methods were the foundation for the new solutions presented in this work. For example, estimating the period of the sea waves, often not discussed in previous works, was relatively simple with machine learning. In conclusion, the experimental results demonstrated that the new system can overcome the reliability issues of the buoy keeping the same accuracy.


Thought-provoking and climactic space-related movies that will captivate you through boundless journeys

FOX News

Fox News Flash top entertainment and celebrity headlines are here. The vastness of the universe has always captivated the human imagination, and filmmakers have often looked to the stars for inspiration. Space-related movies have become a genre of their own, offering audiences an opportunity to explore the unknown, experience the thrill of interstellar travel and ponder the profound questions of our existence. These are some of the most iconic and thought-provoking space-theme films that have left a lasting impact on both the science fiction and Hollywood. 'GRAVITY' REVIEW: THERE HAS NEVER BEFORE BEEN MOVIE LIKE THIS From "2001: A Space Odyssey" to "Interstellar" and space survival tales like "Gravity" and "The Martian," Fox News Digital dives into the cinematic cosmos, celebrating their enduring impact on our love for science fiction.


CapsFusion: Rethinking Image-Text Data at Scale

Yu, Qiying, Sun, Quan, Zhang, Xiaosong, Cui, Yufeng, Zhang, Fan, Cao, Yue, Wang, Xinlong, Liu, Jingjing

arXiv.org Artificial Intelligence

Large multimodal models demonstrate remarkable generalist ability to perform diverse multimodal tasks in a zero-shot manner. Large-scale web-based image-text pairs contribute fundamentally to this success, but suffer from excessive noise. Recent studies use alternative captions synthesized by captioning models and have achieved notable benchmark performance. However, our experiments reveal significant Scalability Deficiency and World Knowledge Loss issues in models trained with synthetic captions, which have been largely obscured by their initial benchmark success. Upon closer examination, we identify the root cause as the overly-simplified language structure and lack of knowledge details in existing synthetic captions. To provide higher-quality and more scalable multimodal pretraining data, we propose CapsFusion, an advanced framework that leverages large language models to consolidate and refine information from both web-based image-text pairs and synthetic captions. Extensive experiments show that CapsFusion captions exhibit remarkable all-round superiority over existing captions in terms of model performance (e.g., 18.8 and 18.3 improvements in CIDEr score on COCO and NoCaps), sample efficiency (requiring 11-16 times less computation than baselines), world knowledge depth, and scalability. These effectiveness, efficiency and scalability advantages position CapsFusion as a promising candidate for future scaling of LMM training.


Earthquake Magnitude and b value prediction model using Extreme Learning Machine

Baveja, Gunbir Singh, Singh, Jaspreet

arXiv.org Artificial Intelligence

Earthquake prediction has been a challenging research area for many decades, where the future occurrence of this highly uncertain calamity is predicted. In this paper, several parametric and non-parametric features were calculated, where the non-parametric features were calculated using the parametric features. $8$ seismic features were calculated using Gutenberg-Richter law, the total recurrence, and the seismic energy release. Additionally, criterions such as Maximum Relevance and Maximum Redundancy were applied to choose the pertinent features. These features along with others were used as input for an Extreme Learning Machine (ELM) Regression Model. Magnitude and time data of $5$ decades from the Assam-Guwahati region were used to create this model for magnitude prediction. The Testing Accuracy and Testing Speed were computed taking the Root Mean Squared Error (RMSE) as the parameter for evaluating the mode. As confirmed by the results, ELM shows better scalability with much faster training and testing speed (up to a thousand times faster) than traditional Support Vector Machines. The testing RMSE came out to be around $0.097$. To further test the model's robustness -- magnitude-time data from California was used to calculate the seismic indicators which were then fed into an ELM and then tested on the Assam-Guwahati region. The model proves to be robust and can be implemented in early warning systems as it continues to be a major part of Disaster Response and management.


World's most mysterious text cracked

Daily Mail - Science & tech

For 600 years it has steadfastly refused to give up its secrets and has beaten some of the world's most brilliant brains, including Alan Turing. Experts variously claimed that the Voynich manuscript - known as the'world's most mysterious text' - contained codes, magic spells, alien messages and even communist propaganda. Eventually most agreed that it was either impossible to solve or else written in gibberish as an elaborate practical joke. But a linguistics expert from the University of Bristol has now cracked it - and it took him just two weeks. Dr Gerard Cheshire worked out that it was written in a dead language - proto-Romance - and then by studying symbols and their descriptions he deciphered the meaning of the letters and words.


Closing in on Egypt Air 'black boxes'

BBC News

The Egypt Air disaster may have dropped out of the news briefly, but the investigation continues apace to find out why flight MS804 crashed. French investigators think they have heard locator-beacon signals from at least one of the "black box" flight recorders, and now salvage experts are heading to the site to take a closer look. Hearing the beacons is one thing, but they won't know for sure what they have found until they send down a robotic submarine armed with bright lights and cameras. "Black boxes" are, in fact, bright orange and have reflective strips, so they show up pretty well when you shine lights on them. The robotic submarine is on a special salvage ship, called the John Lethbridge.

  Country:
  Industry: Transportation > Air (1.00)